Video Summarization Based on Feature Fusion and Data Augmentation

نویسندگان

چکیده

During the last few years, several technological advances have led to an increase in creation and consumption of audiovisual multimedia content. Users are overexposed videos via social media or video sharing websites mobile phone applications. For efficient browsing, searching, navigation across collections repositories, e.g., for finding that relevant a particular topic interest, this ever-increasing content should be efficiently described by informative yet concise representations. A common solution problem is construction brief summary video, which could presented user, instead full so she/he then decide whether watch ignore whole video. Such summaries ideally more expressive than other alternatives, such as textual descriptions keywords. In work, summarization approached supervised classification task, relies on feature fusion audio visual data. Specifically, goal work generate dynamic summaries, i.e., compositions parts original include its most essential segments, while preserving temporal sequence. This annotated datasets per-frame basis, wherein being “informative” “noninformative”, with latter excluded from produced summary. The novelties proposed approach are, (a) prior classification, transfer learning strategy use deep features pretrained models employed. These been used input classifiers, making them intuitive robust objectiveness, (b) training dataset was augmented using publicly available datasets. evaluated three user-generated videos, it demonstrated data augmentation able improve accuracy based human annotations. Moreover, domain independent, any extended rely richer representations modalities.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Video Content Summarization and Augmentation Based on Structural Semantic Processing and Social Network Analysis

Video summarization techniques have been proposed for years to offer people comprehensive understanding of a whole story on video. However, although these traditional methods give brief summaries for users, they still do not provide conceptorganized or structural views. Besides, the knowledge they offer to users is often limited to existing videos. In this study, we present a structural video c...

متن کامل

VIDEO CLASSIFICATION BASED ON LOW−LEVEL FEATURE FUSION MODEL (WedPmPO2)

This article presents a new system for automatically extracting high−level video concepts. The novelty of the approach lies in the feature fusion method. The system architecture is divided into three steps. The first step consists in creating sensors from a low−level (color or texture) descriptor, and a Support Vector Machine (SVM) learning to recognize a given concept (for example, "beach" or ...

متن کامل

Feature based Information Extraction for Generic Video Summarization

Video summarization plays a very significant role in navigating a video, to understand its information or to search the required event information. Our proposed research work minimizes the time required for processing each of the video frames firstly, by reducing their effective size, and then it is followed by an efficient technique for generating the summarized video. The information containe...

متن کامل

Countermeasures for Automatic Speaker Verification Replay Spoofing Attack : On Data Augmentation, Feature Representation, Classification and Fusion

The ongoing ASVspoof 2017 challenge aims to detect replay attacks for text dependent speaker verification. In this paper, we propose multiple replay spoofing countermeasure systems, with some of them boosting the CQCC-GMM baseline system after score level fusion. We investigate different steps in the system building pipeline, including data augmentation, feature representation, classification a...

متن کامل

Lawn Tennis Video Summarization based on Audiovisual and Text Feature Analysis

In this paper, a new video summarization approach for lawn tennis video is presented. The proposed method uses frame color histogram to classify video into play field color shots (PFCS) and non play field color shots (NPFCS). Play field color shots are the segments of interest and used to recognize the tournament class. A dominant colored frame from every PFCS is extracted as a salient frame. O...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Computers

سال: 2023

ISSN: ['2073-431X']

DOI: https://doi.org/10.3390/computers12090186